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A METHOD FOR MINIMIZING SPILL IN CODE SCHEDULED BY A LIST SCHEDULER 

FIELD OF THE INVENTION 

The present invention relates to compilers, and more particularly to list 
schedulers used in association with compilers. 

BACKGROUND OF THE INVENTION 

When computer program source code is compiled, the result is a list of machine 
instructions capable of execution by a target processor The program which converts 
source code to machine instructions is referred to as a compiler. 

Most compilers incorporate a list scheduler. A list scheduler orders machine 
instructions in an attempt to improve the efficiency of their execution. The efficiency of 
a particular ordering of machine instructions is typically measured by the amount of time 
required for a target processor to execute the machine instructions. The list scheduler 
typically reorders an interim ordering of instructions that is generated during a first pass 
of the compilation process. 

The behaviour of a list scheduler is governed by one or more heuristics. A 
heuristic is a particular method of ordering machine instructions. An example of a 
commonly used heuristic is the Critical Path Heuristic. Heuristics may be applied by a 
list scheduler individually or in combination. 

When a set of machine instructions which has been ordered using known 
heuristics is executed, the registers of the target processor can become overcommitted. 
A processor's registers are said to become overcommitted when a machine instruction 
calls for the use of a register, but all of the processor's registers are already in use. In 
this situation, "spill code" is typically generated which causes one or more registers to 
be vacated, so that program execution may proceed. Vacating a register typically 
entails storage of its contents to volatile memory and, after the register has been used 
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for the execution of a machine instruction, restoration of its previous contents from 
volatile memory. Spill code tends to degrade efficiency, as each volatile memory 
access can waste a significant number of clock cycles. This is especially true in respect 
of modern processors whose speeds may be significantly faster than the speeds 
5 associated with volatile memory. 

What is needed is a heuristic which addresses the above noted difficulty. 

SUMMARY OF THE INVENTION 

A method of ordering machine instructions to reduce spill code involves, for each 
machine instruction that is ready for scheduling, determining an amount by which the 

10 size of a committed set of machine instructions would increase upon the scheduling of 
the machine instruction; and scheduling the machine instruction for which the 
determined amount is smallest. The currently committed instructions may be 
determined to be the machine instructions that are already scheduled as well as the 
machine instructions that are descendent from already scheduled machine instructions. 

15 The result is that new computations upon which a target processor will embark tend to 
be deferred. Bit vectors may be employed for efficiency during the assessment of 
candidate instructions that are ready for scheduling. The method may be triggered 
when the risk of registers becoming overcommitted becomes high, as may occur when 
the number of available processor registers drops below a certain threshold. 

20 In accordance with an aspect of the present invention there is provided a method 

of determining an execution order for machine instructions to reduce spill code, 
comprising: from machine instructions that are ready for scheduling, scheduling the 
machine instruction for which an amount by which the size of a committed set of 
machine instructions would increase upon the scheduling of said machine instruction is 

25 smallest. 
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In accordance with another aspect of the present invention there may be 
provided a computer program product having media including computer programmed 
instructions for directing a computing device to implement the above method. 

In accordance with another aspect of the present invention there may be 
provided a computing device comprising a processor and persistent storage memory in 
communication with said processor storing processor readable instructions for directing 
said device to undertake the above method. 

Other aspects and features of the present invention will become apparent to 
those ordinarily skilled in the art upon review of the following description of specific 
embodiments of the invention in conjunction with the accompanying figures. 

BRIEF DESCRIPTION OF THE DRAWINGS 

In the figures which illustrate an example embodiment of this invention: 

FIG. 1 is a schematic diagram illustrating a compiler and list scheduler exemplary 
of the present invention; 

FIG. 2 is a data dependence graph (DDG) illustrating dependencies between 
machine instructions shown in FIG. 1 ; and 

FIG. 3 is a flowchart illustrating operation of the list scheduler of FIG. 1. 

DETAILED DESCRIPTION 

Referring to FIG. 1, a compiler 20 for compiling computer program source code 
10 is illustrated. Compiler 20 incorporates a list scheduler 24 exemplary of an 
embodiment of the present invention. Compiler 20 and list scheduler 24 may be loaded 
for execution by a computing device (not illustrated), such as a PC, from a computer 
readable medium, such as a removable magnetic or optical disk 40. The computing 
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device has a target processor, such as an Intel® Pentium® 4 processor, with which the 
compiler 20 is compatible, as well as volatile memory (also not illustrated). The 
processor has a set of registers. 

The compiler 20 accepts source code 10 to be compiled. The source code 10 
may be written in a higher level programming language such as C. In the present 
embodiment, source code 10 contains the following three instructions: 



Do i = 1 to n 

A[i] = A[i] + B[i] 
C[i] - D[i] + E[i] 



The compiler 20 generates an interim set of machine instructions 22 from source 
code 10 in a first pass of the compilation process. The machine instructions 22 that are 
generated from the above-noted higher level language instructions are set out below: 



(1) LD rl = A[i] 

(2) LD r2 - B[i] 

(3) LD r4 = D[i] 

(4) LD r5 = E[i] 

(5) ADD r3 = rl + r2 

(6) ST A[i] = r3 

(7) ADD r6 = r4 + r5 

(8) ST C[i] = r6 

(9) BRANCH (1) (if counter >0) 



In the machine instructions 22, the labels (1) to (9) are understood not to be part 
of the instructions, but rather have been applied to facilitate reference to individual 
machine instructions in the present description. LD is a load instruction used to load a 
register with a value from volatile memory, ADD is an add instruction used to add 
together two register values, and ST is a store instruction used to store a register value 
to a volatile memory location. It is understood that the store at (8) increments i and the 
branch decrements a loop counter. 
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Certain dependencies inhere in the machine instructions (1) to (9) from the use of 
common processor registers. For example, machine instruction (5) is dependent on 
machine instructions (1) and (2) because machine instruction (5) may not add together 
the values of registers r1 and r2 with a correct result until these registers have been 
loaded by machine instructions (1) and (2) (respectively). 

FIG. 2 illustrates the dependencies between machine instructions (1) to (9) in the 
.form of a data dependence graph (DDG) 50. Each machine instruction (1) to (9) is 
represented by a node in the DDG 50. Directed edges (i.e. arrows) between nodes 
denote dependencies (i.e. ordering constraints) between machine instructions: a 
machine instruction at the head of an arrow may not be executed with a correct 
programmatic result until the machine instruction at the tail of the arrow has completed. 
The machine instruction at the head of an arrow is said to be "dependent" on the 
machine instruction at the tail of an arrow. The machine instructions along a path of 
directed edges originating from a machine instruction are referred to as "descendants" 
of that machine instruction. For certainty, it is noted that a machine instruction may 
have descendent machine instructions regardless of whether or not a DDG has been 
created. 

Each directed edge has associated with it a hardware delay measured in a 
number of hardware cycles required for the instruction at the tail of the arrow to 
complete on the target processor. In the DDG 50, the delay associated with each 
directed edge is 2 cycles. 

Referring again to FIG. 1, the interim set of machine instructions 22 is passed to 
the list scheduler 24. The purpose of list scheduler 24 is to reorder machine instructions 
22 in an attempt to improve the efficiency of their execution. The list scheduler 24 
applies a heuristic to the instructions 22 in accordance with an embodiment of the 
present invention in furtherance of this goal. A reordered set of machine instructions 30 
results. 
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In overview, application of the heuristic entails an iterative sequence of: 
evaluating which machine instructions are presently committed (i.e. which instructions 
descend from instructions which have already been reordered, that is, scheduled for 
execution); evaluating which machine instructions would be committed if each of the 
machine instructions that is ready for scheduling were to be scheduled as the next 
machine instruction; and scheduling as the next machine instruction the machine 
instruction which results in the smallest increase in the number of committed machine 
instructions. This approach effectively defers new computations upon which the target 
processor embarks, and thus tends to minimize the demand for registers at any given 
time. 

Operation 300 of the heuristic is illustrated in FIG. 3. The objective of the 
operation 300 is to reorder the set of machine instructions 22 into a new, more efficient 
execution order in machine instructions 30. The set of machine instructions 30 may 
initially be viewed as an empty list having as many empty machine instruction slots to 
be filled as there are machine instructions to be reordered (i.e. nine): 

d') 

(2') 

(3') 

(4') 

(5') 

(6') 

(7') 

(8') 

(9') 



The reordered instructions are labeled (1') to (9') above to distinguish them from 
the labels (1) to (9) used for machine instructions 22. 

A "Ready set" of instructions ready for scheduling but as yet unscheduled is 
initially determined (S302). An instruction is deemed ready for scheduling if it can be 
scheduled as the next instruction in machine instructions 30 without resulting causing 
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the executable program code to be incorrect (e.g. as would be the case if an ADD 
instruction were to be scheduled ahead of a LD instruction which loads a register to be 
added). This definition of "ready" should be distinguished from "data ready", which 
describes an instruction that is ready for scheduling because it is not dependent on any 
other instruction that has not completed. 

In the present example, the Ready set initially contains machine instructions (1) 

to (4): 

Ready = {1,2,3,4} 

Next, for each machine instruction X of the instructions (1) to (9) to be scheduled, 
a function Below(X) is computed and the result stored (S304). Below(X) is a function 
which identifies X and X's descendants. The Below(X) values will be used during the 
reordering process during assessment of candidate instructions for scheduling. The 
Below(X) values are each stored as a bit vector with one bit for each instruction to be 
reordered. The use of such bit vectors during operation 300 is for the purpose of 
limiting computational cost. The ordinal position of each bit indicates the number of the 
represented machine instruction. A bit which has been set ("1") indicates inclusion 
within a set while a bit which has been reset ("0") indicates exclusion from the set: 



Below (1) 




{1,5,6, 


9} 


or 


100011001 


Below (2) 




{2,5,6, 


9} 


or 


010011001 


Below (3) 




{3,7,8, 


9} 


or 


001000111 


Below (4) 




{4,7,8,9} 


or 


000100111 


Below (5) 




{5,6,9} 




or 


000011001 


Below (6) 




{6,9} 




or 


000001001 


Below (7) 




{7,8,9} 




or 


000000111 


Below (8) 




{8,9} 




or 


000000011 


Below (9) 




{9} 




or 


000000001 



Subsequently, a set of committed instructions (the "Committed" set) is 
determined (S306). The Committed set includes all machine instructions that have 
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been scheduled (i.e., reordered into a "slot" in machine instructions 30) as well as all 
machine instructions descendent from instructions that have been scheduled. The 
Committed set is represented as a bit vector similar to the Below(X) bit vectors. The 
Committed set is determined by computing Below(Y) for each instruction Y that the list 
scheduler has already scheduled and taking a bitwise OR (e.g. as represented by a 
single vertical bar) of these values. At the present stage, the Committed set is empty as 
no instructions have yet been scheduled: 

Committed = { } or 000000000 

Accordingly, operation at S308 determines that uncommitted machine 
instructions do in fact remain. 

Next, the bitwise complement of the Committed bit vector is taken (S309). This 
value is stored for later use during assessment of machine instructions for scheduling: 

! Committed - 111111111 

A variable BEST_SO_FAR is then initialized to -1 (S310). The BEST_SO_FAR 
variable will be used to identify the machine instruction that is the best candidate for 
scheduling into the next open slot of machine instructions 30. 

As all of the instructions of the Ready set are as yet unprocessed (S312), 
machine instruction (1) is identified as the next ready instruction (S314). 

A bitwise AND is computed between the previously computed values of Below(1) 
and !(Committed), and a population count of the resultant bit vector is taken and 
assigned to a temporary variable Z (S316). A population count operator, represented 
by a pair of vertical bars, counts the number of "1s M in a bit vector: 
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Z = I Below (1) & [(Committed) I 
= | 100011001 & 111111111 I 
= | 100011001 I 
= 4 



Z represents the number of previously uncommitted instructions which would 
become committed if machine instruction (1) were to be scheduled into slot (1 ')■ 

Because BEST_SO_FAR still has its initial value of -1 (S318), it is set to the Z 
value of 4 (S320). A BESTJNSTRUCTION variable representative of the best 
candidate for scheduling into the next available slot (1*) is set to indicate machine 
instruction (1) (S322). 

The operation of S312-S322 is repeated for each remaining machine instruction 
(2), (3), and (4) of the Ready set. In the result, the BEST_SO_FAR value is left with a 
value of 4 and the BESTJNSTRUCTION VALUE remains (1), because none of the 
other ready machine instructions (2), (3) or (4) would result a smaller increase than 4 in 
the number of members of the Committed set. Actually, because all four instructions (1) 
to (4) would increase the Committed set by four, they are all equally good candidates for 
scheduling into slot (1 '). In the result, machine instruction (1) is scheduled into slot (1 ') 
machine instructions 30 (S324): 



(1') LD rl = A[i] 

(2') 

(3') 

(4') 

(5') 

(6') 

(7') 

(8') 

(9') 



The Ready set is updated to remove instruction (1) to reflect this scheduling 
(S326). 
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Ready = {2,3,4} 



It is noted that instruction (5) is not added to the Ready set, as an incorrect 
programmatic result would occur if instruction (5) were to be scheduled ahead of 
instruction (2), which is as yet unscheduled. 

Operation continues (S306, S308, S309) with the following computations: 



Committed = Below (1) = {1,5,6,9} or 100011001 
! Committed « 011100110 



Three passes through operation S3 12 to S322 (one for each of instruction (2) to 
(4) of the Ready set) yields the following computations: 

if (2) were to be scheduled next: 

Committed would = Below (1) | Below (2) 

- {1,5,6,9} | {2,5,6,9} 
= {1,2,5,6,9} 

representing an increase Z - 1, 

computed as follows: 

Z = | Below(2) & [(Committed) | 

= I 010011001 & 011100110 | 

= I 010000000 I 

= 1 

if (3) were to be scheduled next: 

Committed would = Below (1) I Below (3) 
= {1,5,6,9} | {3,7,8,9} 

- {1,3,5,6,7,8,9} 
representing an increase Z = 3, 
computed as follows: 

Z = | Below(3) & ! (Committed) I 
= I 001000111 & 011100110 I 
= I 001000110 I 
= 3 

if (4) were to be scheduled next: 

Committed would - Below (1) | Below (4) 
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= {1,5,6,9} | {4,7,8,9} 
= {1,4,5,6,7,8,9} 
representing an increase Z = 3, 
computed as follows: 
Z = | Below (4) & ! (Committed) I 
= I 000100111 & 011100110 | 
= I 000100110 | 
= 3 

Z = smallest if (2) is scheduled 
Conclusion: schedule (2) 



Thus, machine instructions (2) is scheduled in slot (2') of machine instructions 30 
(S324), as it results in the smallest increase in the Committed set: 



(1* 
(2* 
(3' 
(4' 
(5' 
(6' 
(7* 
(8- 
(9' 



LD 
LD 



rl = 
r2 = 



A[i] 
B[i] 



The Ready set is updated to remove instruction (2) and to add instruction (5), 
which may now feasibly be scheduled without causing an incorrect programmatic result 
(S326): 

Ready = {3,4,5} 

Operation continues at S308 to S324 with the following computations and 
conclusion (bit vector calculations are omitted for brevity): 



CA9-2003-0011 



[ 



CA 02428284 2003-05-07 



Committed = {1,2,5,6,9} 

if (3) were to be scheduled next: 

Committed would = {1,2,3,5,6,7,8,9} 
representing an increase Z = 3 

if (4) were to be scheduled next: 

Committed would = {1,2,4,5,6,7,8,9} 
representing an increase Z = 3 

if (5) were to be scheduled next: 
Committed would = {1,2,5,6,9} 
representing an increase Z * 0 smallest* *) 

Conclusion: schedule (5) 



The Ready set is updated (S316): 



Ready = {3,4,6} 



Subsequently, it is determined that the Committed set will stay the same (S306, 
S308, S309): 



Committed = {1,2,5,6,9} 



Operation at S3 10 to S324 results in the scheduling of instruction (6) next, for 
similar reasons to the scheduling of (5). The set of scheduled machine instructions 30 
is thus now: 



(1') LD rl = A[i] 
(2') LD r2 = B[i] 
(3' ) ADD r3 = rl + r2 
(4') ST A[i] = r3 



CA9-2003-001 1 



-12- 



I 



CA 02428284 2003-05-07 



(5') 

(6') 

(7') 

(8') 

(9') 

and the Ready set is left with only two instructions: 

Ready = {3,4} 
Operation continues at S306 to S324 with the following conclusion: 

Committed = {1,2,5,6,9} 

if (3) were to be scheduled next: 

Committed would = {1,2,3,5,6,7,8,9} 
representing an increase Z = 3 

if (4) were to be scheduled next: 

Committed would = {1,2,4,5,6,7,8,9} 
representing an increase Z = 3 

i.e. the two instructions are equally good candidates 

Conclusion: schedule (3) 

Updating of the Ready set (S316) leaves only one instruction: 

Ready = {4} 

thus machine instruction (4) is therefore scheduled next. 

In the subsequent operation, it is determined that no uncommitted instructions 
remain (S308) and that all instructions remaining to be scheduled would therefore 
increase the Committed set by the same amount (i.e. zero). As a result, the remaining 
unscheduled instructions (7), (8), and (9) may be scheduled in any manner that does 
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not produce an incorrect programmatic result (S328). The final reordered set of 
machine instructions 30 is thus: 

(1') LD rl = A[i] 

5 <2') LD r2 = B[i] 

(3') ADD r3 = rl + r2 

(4') ST A[i] = r3 

(5') LD r4 = D[i] 

(6') LD r5 = E[i] 

10 (7') ADD r6 = r4 + r5 

(8') ST C[i] = r6 

(9') Branch (1) 

Operation 300 is thus concluded. 

15 It is noted that operation 300 may be triggered by the list scheduler 24 only when 

the risk of registers becoming overcommitted becomes high. This may be the case 
when the number of available registers drops below a certain threshold, e.g., as may be 
assessed by counting the number of extant live ranges (i.e. the number of registers 
presently in use over some range of instructions). In such cases, an alternative 

20 heuristic may be employed prior to the triggering of the present heuristic. 

It is noted that machine instructions 30 may in some cases turn out to have the 
same order as machine instructions 22 where no efficiency gains can be realized 
through application of the heuristic. 

As will be appreciated by those skilled in the art, modifications to the above- 
25 described embodiment can be made without departing from the essence of the 
invention. For example, the use of bit vectors by the list scheduler is optional, although 
typically advantageous for increasing the efficiency of operation 300, especially when 
the set of machine instructions 22 to be reordered is large. 

In another alternative, the function Below(X) employed during operation 300 may 
30 include only the descendents of instruction X and not instruction X itself as described 
above. In this case the Committed set might instead be referred to as the 
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ToBeCommitted" set as it would not include instructions that have already been 
scheduled. Such an embodiment would operate similarly to the above-described 
embodiment, except that the computed Z values would all be reduced by 1. The 
ultimate scheduling of machine instructions 30 would be the same. 

5 For instance, in an exemplary second pass through operation S306 to S324 in 

such an alternative embodiment, the following calculations would occur 

ToBeCommitted = Below (1) = {5,6,9} or 000011001 
! ToBeCommitted = 111100110 

10 Three passes through operation S312 to S322 (one for each of instruction (2) to 

(4) of the Ready set) would yield the following computations: 

if (2) were to be scheduled next; 

ToBeCommitted would = Below (1) | Below (2) 

= {5,6,9} | {5,6,9} 
15 = {5,6,9} 

representing an increase Z = 0, 
computed as follows: 
Z = | Below (2) & ! (ToBeCommitted) I 
= | 000011001 & 111100110 i 
20 = | 000000000 | 

= 0 

if (3) were to be scheduled next: 

ToBeCommitted would = Below (1) | Below (3) 
25 = {5,6,9} | {7,8,9} 

= {5,6,7,8,9} 
representing an increase Z « 2, 
computed as follows: 
Z - | Below(3) & ! (ToBeCommitted) i 
30 = | 000000111 & 111100110 I 

= | 000000110 | 
= 2 

if (4) were to be scheduled next: 
35 ToBeCommitted would = Below (1) | Below (4) 

- {5,6,9} | {7,8,9} 
= {5,6,7,8,9} 
representing an increase Z = 2, 
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computed as follows: 

Z = | Below (4) & ! (ToBeCommitted) | 
= | 000000111 & 111100110 | 
= | 000000110 | 
= 2 

Z = smallest if (2) is scheduled 
Conclusion: schedule (2) 

Thus, the same conclusion as was reached in this pass of the first-described 
embodiment, i.e. that machine instruction (2) should be scheduled next, is reached in 
this embodiment. 

For clarity, it is noted that the terminal condition evaluated in S308 of such an 
alternative embodiment would need to assess whether uncommitted and unscheduled 
instructions remain, rather than just whether uncommitted instructions remain. 

Other modifications will be apparent to those skilled in the art and, therefore, the 
invention is defined in the claims. 
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What is claimed is: 

1 . A method of determining an execution order for machine instructions to reduce spill 
code, comprising: 

from machine instructions that are ready for scheduling, scheduling the machine 
instruction for which an amount by which the size of a committed set of machine 
instructions would increase upon the scheduling of said machine instruction is 
smallest. 

2. The method claim 1 , wherein said committed set of machine instructions includes 
any machine instruction that is already scheduled and any machine instruction that 
is descendent from an already scheduled machine instruction. 

3. The method claim 2, wherein said amount is determined by, for said each machine 
instruction ready for scheduling: 

identifying descendent machine instructions of said each machine instruction; 
and 

determining which of said descendent machine instructions and said each 
machine instruction is not in said committed set of machine instructions. 

4. The method claim 1, wherein said committed set of machine instructions includes 
any machine instruction that is descendent from an already scheduled machine 
instruction. 
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5. The method claim 4, wherein said amount is determined by, for said each machine 
instruction ready for scheduling: 

identifying descendent machine instructions of said each machine instruction; 
and 

determining which of said descendent machine instructions is not in said 
committed set of machine instructions. 

6. The method claim 1 , wherein a given machine instruction is considered ready for 
scheduling when scheduling of said given machine instruction as a next machine 
instruction would not cause an erroneous programmatic result. 

7. The method of claim 1 , wherein said method is undertaken when a risk of register 
overcommittedness exceeds a certain threshold. 

8. The method of claim 7, wherein said threshold is exceeded when processor register 
availability drops below a particular threshold. 

9. A method of determining an execution order for machine instructions to reduce spill 
code, comprising: 

in a first bit vector containing one bit to represent each machine instruction to be 
scheduled, setting those bits for which the represented machine instruction is not 
committed, and resetting the remaining bits; 

for said each machine instruction to be scheduled that is ready for scheduling: 
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in a second bit vector also having one bit to represent each machine instruction 
to be ordered in the same sequence as in said first bit vector, setting those bits 
for which the represented machine instruction is a descendant of said each 
machine instruction that is ready for scheduling, and resetting the remaining bits; 

performing a bitwise AND operation of said first bit vector and said second bit 
vector to create a third bit vector; and 

determining the number of set bits in said third bit vector; and 

selecting for execution the machine instruction for which said third bit vector contains 
a minimum number of set bits. 

10. The method of claim 9 further comprising, prior to performing said bitwise AND 
operation, setting in said second bit vector the bit for which the represented machine 
instruction is said each machine instruction that is ready for scheduling. 

1 1 .A computer program product having media including computer programmed 
instructions for directing a computing device to implement the method of any of 
claims 1 to 10. 

12. A computing device comprising a processor and persistent storage memory in 
communication with said processor storing processor readable instructions for 
directing said device to undertake the method of any of claims 1 to 10. 
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ABSTRACT 

A method of ordering machine instructions to reduce spill code involves, for each 
machine instruction that is ready for scheduling, determining an amount by which the 
size of a committed set of machine instructions would increase upon the scheduling of 
5 the machine instruction; and scheduling the machine instruction for which the 
determined amount is smallest The currently committed instructions may be 
determined to be the machine instructions that are already scheduled as well as the 
machine instructions that are descendent from already scheduled machine instructions. 
The result is that new computations upon which a target processor will embark tend to 
10 be deferred. Bit vectors may be employed for efficiency during the assessment of 
candidate instructions that are ready for scheduling. The method may be triggered 
when the risk of registers becoming overcommitted becomes high, as may occur when 
the number of available processor registers drops below a certain threshold. 
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